AITopics

Country:

North America > United States (1.00)
Europe (0.67)

Genre: Research Report (0.69)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Neural Information Processing SystemsFeb-16-2026, 01:10:59 GMT

7b871f53ba2ee540ef76e6220b462e16-Paper-Conference.pdf

data mining, machine learning, natural language, (20 more...)

Country:

Africa > South Sudan > Equatoria > Central Equatoria > Juba (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Santa Clara County > San Jose (0.04)
(2 more...)

Genre: Research Report > Experimental Study (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
(4 more...)

Mandal, Bibhabasu, Nandy, Sagnik

Privacy utility trade offs for parameter estimation in degree heterogeneous higher order networks

arXiv.org Machine LearningFeb-5-2026

In sensitive applications involving relational datasets, protecting information about individual links from adversarial queries is of paramount importance. In many such settings, the available data are summarized solely through the degrees of the nodes in the network. We adopt the $β$ model, which is the prototypical statistical model adopted for this form of aggregated relational information, and study the problem of minimax-optimal parameter estimation under both local and central differential privacy constraints. We establish finite sample minimax lower bounds that characterize the precise dependence of the estimation risk on the network size and the privacy parameters, and we propose simple estimators that achieve these bounds up to constants and logarithmic factors under both local and central differential privacy frameworks. Our results provide the first comprehensive finite sample characterization of privacy utility trade offs for parameter estimation in $β$ models, addressing the classical graph case and extending the analysis to higher order hypergraph models. We further demonstrate the effectiveness of our methods through experiments on synthetic data and a real world communication network.

artificial intelligence, differential privacy, machine learning, (17 more...)

arXiv.org Machine Learning

2602.03948

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Ohio > Franklin County > Columbus (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(3 more...)

Genre: Research Report > New Finding (0.48)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.46)
Health & Medicine > Therapeutic Area > Immunology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Neural Information Processing SystemsOct-10-2025, 06:56:49 GMT

Truthful High Dimensional Sparse Linear Regression

The problem involves two intertwined components: data acquisition and privacy-preserving data analysis.

agent, estimator, mechanism, (15 more...)

Country:

Africa > South Sudan > Equatoria > Central Equatoria > Juba (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Santa Clara County > San Jose (0.04)
(2 more...)

Genre: Research Report > Experimental Study (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Game Theory (0.93)
(3 more...)

Neural Information Processing SystemsOct-11-2024, 10:57:48 GMT

Differentially Private Learning of Structured Discrete Distributions

We investigate the problem of learning an unknown probability distribution over a discrete population from random samples. Our goal is to design efficient algorithms that simultaneously achieve low error in total variation norm while guaranteeing Differential Privacy to the individuals of the population.We describe a general approach that yields near sample-optimal and computationally efficient differentially private estimators for a wide range of well-studied and natural distribution families. Our theoretical results show that for a wide variety of structured distributions there exist private estimation algorithms that are nearly as efficient - both in terms of sample size and running time - as their non-private counterparts. We complement our theoretical guarantees with an experimental evaluation. Our experiments illustrate the speed and accuracy of our private estimators on both synthetic mixture models and a large public data set.

differentially private learning, private estimator, structured discrete distribution, (1 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.46)

Chadha, Karan, Duchi, John, Kuditipudi, Rohit

Resampling methods for Private Statistical Inference

arXiv.org Artificial IntelligenceFeb-11-2024

Releasing statistics using sensitive data can hurt the privacy of individuals contributing to the data (Narayanan and Shmatikov, 2008; Dick et al., 2023). Differential privacy (Dwork et al., 2006) is now a widely accepted solution for performing statistical analysis while protecting sensitive data. In the years since its release, researchers have made considerable progress in the development of differentially private estimators for a range of statistical problems such as mean estimation, median estimation, logistic regression (Asi and Duchi, 2020; Chaudhuri et al., 2011). However, deriving a conclusion from a single point estimate--whether an empirical mean or a classifier prediction-- without any consideration of uncertainty can lead to faulty, inaccurate decision-making (Gelman and Loken, 2013). To have any hope of making private statistical tools broadly applicable, we must build the requisite inferential tools. Constructing confidence intervals around a give point estimate is the most basic inferential task. We therefore develop tools to do so for a broad class of statistics of interest with differential privacy.

algorithm, confidence interval, estimator, (17 more...)

2402.07131

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > France (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Research Report > New Finding (0.66)
Research Report > Experimental Study (0.48)

Industry: Information Technology > Security & Privacy (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.64)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.35)

Asi, Hilal, Ullman, Jonathan, Zakynthinou, Lydia

From Robustness to Privacy and Back

arXiv.org Artificial IntelligenceFeb-3-2023

We study the relationship between two desiderata of algorithms in statistical inference and machine learning: differential privacy and robustness to adversarial data corruptions. Their conceptual similarity was first observed by Dwork and Lei (STOC 2009), who observed that private algorithms satisfy robustness, and gave a general method for converting robust algorithms to private ones. However, all general methods for transforming robust algorithms into private ones lead to suboptimal error rates. Our work gives the first black-box transformation that converts any adversarially robust algorithm into one that satisfies pure differential privacy. Moreover, we show that for any low-dimensional estimation task, applying our transformation to an optimal robust estimator results in an optimal private estimator. Thus, we conclude that for any low-dimensional task, the optimal error rate for $\varepsilon$-differentially private estimators is essentially the same as the optimal error rate for estimators that are robust to adversarially corrupting $1/\varepsilon$ training samples. We apply our transformation to obtain new optimal private estimators for several high-dimensional tasks, including Gaussian (sparse) linear regression and PCA. Finally, we present an extension of our transformation that leads to approximate differentially private algorithms whose error does not depend on the range of the output space, which is impossible under pure differential privacy.

algorithm, artificial intelligence, machine learning, (17 more...)

2302.01855

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > Nevada > Clark County > Las Vegas (0.04)
(5 more...)

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.48)

arXiv.org Artificial IntelligenceSep-16-2022

Truthful Generalized Linear Models

Qiu, Yuan, Liu, Jinyan, Wang, Di

In this paper we study estimating Generalized Linear Models (GLMs) in the case where the agents (individuals) are strategic or self-interested and they concern about their privacy when reporting data. Compared with the classical setting, here we aim to design mechanisms that can both incentivize most agents to truthfully report their data and preserve the privacy of individuals' reports, while their outputs should also close to the underlying parameter. In the first part of the paper, we consider the case where the covariates are sub-Gaussian and the responses are heavy-tailed where they only have the finite fourth moments. First, motivated by the stationary condition of the maximizer of the likelihood function, we derive a novel private and closed form estimator. Based on the estimator, we propose a mechanism which has the following properties via some appropriate design of the computation and payment scheme for several canonical models such as linear regression, logistic regression and Poisson regression: (1) the mechanism is $o(1)$-jointly differentially private (with probability at least $1-o(1)$); (2) it is an $o(\frac{1}{n})$-approximate Bayes Nash equilibrium for a $(1-o(1))$-fraction of agents to truthfully report their data, where $n$ is the number of agents; (3) the output could achieve an error of $o(1)$ to the underlying parameter; (4) it is individually rational for a $(1-o(1))$ fraction of agents in the mechanism ; (5) the payment budget required from the analyst to run the mechanism is $o(1)$. In the second part, we consider the linear regression model under more general setting where both covariates and responses are heavy-tailed and only have finite fourth moments. By using an $\ell_4$-norm shrinkage operator, we propose a private estimator and payment scheme which have similar properties as in the sub-Gaussian case.

agent, artificial intelligence, machine learning, (18 more...)

2209.07815

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report > Experimental Study (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Ghassemi, Mohsen, Kreačić, Eleonora, Dalmasso, Niccolò, Potluru, Vamsi K., Balch, Tucker, Veloso, Manuela

Differentially Private Learning of Hawkes Processes

arXiv.org Artificial IntelligenceJul-27-2022

Hawkes processes have recently gained increasing attention from the machine learning community for their versatility in modeling event sequence data. While they have a rich history going back decades, some of their properties, such as sample complexity for learning the parameters and releasing differentially private versions, are yet to be thoroughly analyzed. In this work, we study standard Hawkes processes with background intensity $\mu$ and excitation function $\alpha e^{-\beta t}$. We provide both non-private and differentially private estimators of $\mu$ and $\alpha$, and obtain sample complexity results in both settings to quantify the cost of privacy. Our analysis exploits the strong mixing property of Hawkes processes and classical central limit theorem results for weakly dependent random variables. We validate our theoretical findings on both synthetic and real datasets.

hawke process, privacy, probability, (15 more...)

2207.13741

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > California (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre:

Research Report (0.64)
Instructional Material (0.46)

Industry:

Information Technology > Security & Privacy (1.00)
Banking & Finance (0.92)
Education > Educational Setting > Online (0.68)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.46)

Vepakomma, Praneeth, Pushpita, Subha Nawer, Raskar, Ramesh

Private measurement of nonlinear correlations between data hosted across multiple parties

arXiv.org Machine LearningOct-18-2021

We introduce a differentially private method to measure nonlinear correlations between sensitive data hosted across two entities. We provide utility guarantees of our private estimator. Ours is the first such private estimator of nonlinear correlations, to the best of our knowledge within a multi-party setup. The important measure of nonlinear correlation we consider is distance correlation. This work has direct applications to private feature screening, private independence testing, private k-sample tests, private multi-party causal inference and private data synthesis in addition to exploratory data analysis. Code access: A link to publicly access the code is provided in the supplementary file.

correlation, distance correlation, estimator, (11 more...)

arXiv.org Machine Learning

2110.0967

Country:

Asia > South Korea > Seoul > Seoul (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)

Genre: Research Report (0.50)

Industry: Information Technology > Security & Privacy (0.88)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Security & Privacy (0.88)
Information Technology > Data Science (0.68)